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(Dated: April 9, 2005) 

We have performed a search for neutral Higgs bosons produced in association with bottom quarks 
in pp collisions, using 260 pb _1 of data collected with the D0 detector in Run II of the Fermilab 
Tevatron Collider. The cross sections for these processes are enhanced in many extensions of the 
standard model (SM), such as in its minimal supersymmetric extension at large tan/3. The results 
of our analysis agree with expectations from the SM, and we use our measurements to set upper 
limits on the production of neutral Higgs bosons in the mass range of 90 to 150 GeV. 



PACS numbers: 12.38.Qk, 12.60.Fr, 13.85.Rm, 14.80.Cp 



In two-Higgs-doublet models of electroweak symmetry ical Higgs bosons: two neutral CP-even scalars, h and 
breaking, such as the minimal supersymmetric extension H , with H being the heavier state; a neutral CP-odd 
of the standard model (MSSM) |l|, there are five phys- state, A; and two charged states, H ± . The ratio of the 
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FIG. 1: Leading-order Feynman diagrams for neutral Higgs 
boson production in the five-flavor scheme (top) and four- 
flavor scheme (bottom). 



vacuum expectation values of the two Higgs fields is de- 
fined as tan j3 = Vijv\ , where vi and v\ refer to the fields 
that couple to the up- type and down- type fermions, re- 
spectively. At tree level, the coupling of the A boson to 
down-type quarks, such as the b quark, is enhanced by 
a factor of tan/3 relative to the standard model (SM), 
and the production cross section is therefore enhanced 
by tan 2 (3 Q • At large tan /3, this is also true either for 
the h or H boson depending on their mass. 

For several representative scenarios of the MSSM, LEP 
experiments have excluded at the 95% C.L. a light Higgs 
boson with mass to/j < 92.9 GeV |3j. At hadron colliders, 
neutral Higgs bosons can be produced in association with 
b quarks, leading to final states containing three or four 
b jets. The CDF experiment at the Tevatron Collider 
performed a search for these events in data from Run I Q . 

Higgs boson production in association with b quarks in 
pp collisions can be calculated in two ways: in the five- 
flavor scheme |5j, only one b quark has to be present, 
while in the four-flavor scheme |6j, two b quarks are ex- 
plicitly required in the final state. Both calculations are 
now available at next-to- leading order (NLO), and agree 
within their respective theoretical uncertainties 0, @- 
Figure ^ illustrates these processes for h production 
at leading order (LO), and analogous diagrams can be 
drawn for the H and A bosons. 

In this Letter, we assume CP-conservation in the Higgs 
sector. The masses, widths, and branching fractions for 
the neutral Higgs bosons into bb pairs are calculated us- 
ing the CPsuperH program [sUlfJ. The current analysis 
is sensitive to tan/3 in the range 50 - 100, and depends 
on the Higgs boson mass. In this region of tan/3, the 
A boson is nearly degenerate in mass with either the h 
or the H boson, and their widths are small compared 
to the di-jet mass resolution. Consequently, we cannot 
distinguish between the h/H and the A, and the total 
cross section for signal is assumed to be twice that of 
the A boson. In the region of m,A from 100 to 130 GeV, 



all three neutral Higgs bosons can be degenerate in mass 
and produced simultaneously Nevertheless, the to- 
tal cross section still remains twice that of the A boson. 
Using data collected by the D0 detector from November 
2002 to June 2004, corresponding to an integrated lumi- 
nosity of about 260 pb^" 1 , we search for an excess in the 
invariant mass distribution of the two leading transverse 
momentum (pt) jets in events containing three or more 
b quark candidates. 

The D0 detector has a magnetic central tracking sys- 
tem surrounded by a uranium/liquid- argon calorimeter, 
contained within a muon spectrometer. The tracking sys- 
tem consists of a silicon microstrip tracker (SMT) and a 
central fiber track er ( CFT), both located within a 2 T 
solenoidal magnet [lj. The SMT and CFT have designs 
optimized for tracking and vertexing at pseudorapidities 
\ri\ < 2.5, where r] = — ln(tan(#/2)) and 9 is the po- 
lar angle with respect to the proton beam direction (z). 
The calorimeter has a central section (CC) covering up to 
« 1.1, and two end calorimeters (EC) extending cov- 
erage to I77I w 4.2, all housed in separate cryostats [l3j |. 
The calorimeter is divided into an electromagnetic part 
followed by fine and coarse hadronic sections. Scintil- 
lators between the CC and EC cryostats provide addi- 
tional sampling of developing showers for 1.1 < \rj\ < 1.4. 
The muon system consists of a layer of tracking detec- 
tors and scintillation trigger counters in front of 1.8 T 
toroidal magnets, followed by two similar layers behind 
the toroids, which provide muon tracking for \rj\ < 2. The 
luminosity is measured using scintillator arrays located 
in front of the EC cryostats, covering 2.7 < \rj\ < 4.4. 
The trigger system comprises three levels (LI, L2, and 
L3), each performing an increasingly detailed event re- 
construction in order to select the events of interest. 

The large cross section for multijet production necessi- 
tates a specialized trigger to maximize signal acceptance 
while providing reasonable rates. This trigger at LI re- 
quires signals in at least three calorimeter towers of size 
A77 x Aip = 0.2 x 0.2 (where ip is the azimuthal an- 
gle), each with transverse energy Et > 5 GeV; three 
clusters and H^ 2 > 50 GeV at L2 (H^ 2 = scalar sum 
of the L2 clusters Et with Et>5 GeV), and three jets 
with px > 15 GeV at L3. A total of 87 million events 
were selected off-line with one jet of pt > 20 GeV and 
at least two more jets with pt > 15 GeV. Jets are re- 
constructed using a Run II cone algorithm with ra- 
dius ATI = ^(Arj) 2 + (Atp) 2 < 0.5, and are then re- 
quired to pass a set of quality criteria. To be accepted 
for further analysis, jets with pt > 15 GeV must have 
1 77 1 < 2.5. The jet energies are corrected to the particle 
level using ry-dependent scale factors. Events with up to 
five jets are selected if they have a primary vertex posi- 
tion \z\ < 35 cm and at least three jets with corrected 
Pt > 35, 20, and 15 GeV. Depending on the hypothesized 
Higgs boson mass, the final selections are chosen to opti- 
mize the expected signal significance, defined as S/ \f~B~, 
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where S (B) refers to the number of signal (background) 
events. Jets containing 6 quarks are identified using a 
secondary vertex (SV) tagging algorithm. A jet is tagged 
as a 6-jet if it has at least one SV within A1Z < 0.5 of the 
jet axis and a transverse displacement from the primary 
vertex that exceeds five times the displacement uncer- 
tainty. Jets are 6 tagged up to |ry| < 2.5, although the 
b tagging is about twice as efficient in the central region 
(\r]\ < 1.1) because of the CFT coverage. The b tagging 
efficiency is w 55% for central 6-jets of px > 35 GeV, 
with a light quark (or gluon) tag rate of about 1%. 

Signal events were simulated using the pythia 0] 
event generator followed by the full D0 detector sim- 
ulation and reconstruction chain, pythia minimum-bias 
events were added to all generated events, using a Pois- 
son probability with a mean of 0.4 events to match the 
instantaneous luminosities at which the data were taken 
(1 — 6 x 10 31 cm _2 s _1 ). The bh events, with /i— >66, were 
generated for Higgs boson masses from 90 to 150 GeV. 
Reconstructed jets in simulated events were corrected to 
match the jet reconstruction and identification efficien- 
cies in data. The energy of simulated jets was smeared 
to match the measured jet energy resolution. The px and 
rapidity spectra of the Higgs bosons from PYTHIA were 
compared to those from the NLO calculation 5]. The 
shapes were similar, indicating that the pythia kine- 
matics are approximately correct. The simulated events 
were weighted to match the px spectrum of the Higgs 
boson given by NLO, resulting in a 10% reduction of the 
overall signal efficiency. 

Of all SM processes, multijet production is the major 
source of background. This background is determined 
from data by normalizing distributions outside of the 
signal region. As a cross-check, we also compare data 
with simulations. ALPGEN 0] is used to generate three 
samples of events for bbj and bbjj with j correspond- 
ing to up, down, strange or charm quarks, or gluons, 
and bbbb final states with generator-level requirements: 
p\ > 25 GeV, p T > 15 GeV, \r)\ < 3.0, and AH > 0.4 
between any two final-state partons. These selections 
do not introduce significant bias because the final sam- 
ple contains much harder jets, after the application of 
trigger and 6-tagging requirements. Samples of bbj and 
bbjj are added together, but the bbjj sample is weighted 
by 0.85 to match the jet multiplicity observed in doubly 
6-tagged data. The cross sections obtained from ALP- 
GEN are 8.9 nb, 3.9 nb, and 60 pb, for the respective 
three states. All other backgrounds are expected to be 
small and are simulated with pythia: pp^ Z(^bb) +\ets, 
pp^Zb, and pp-^tt. Cross sections of 1.2 nb, 40 pb |17j . 
and 7 pb are assumed, respectively. 

There are two main categories of multijet background. 
One contains genuine heavy-flavor (HF) jets, while the 
other has only light-quark or gluon jets that are mistak- 
enly tagged as 6-quark jets, or correspond to gluons that 
branch into nearly collinear bb pairs. Using the selected 



data sample, before the application of 6-tagging require- 
ments, the probability to 6-tag a jet is measured as a 
function of its px in three \r)\ regions. These functions 
are called "mis-tag" functions. They are corrected for 
the contamination from true HF events by subtracting 
the estimated fraction of bbj (J) events in the multijet 
data sample (1.2%), obtained from an initial fit to the 
doubly 6-tagged data. These corrected mis-tag functions 
are then used to estimate the mis-tagged background, by 
applying them to every jet reconstructed in the full data 
sample. 

In order to test the modeling of the mis-tag back- 
ground, the high statistics doubly 6-tagged data is com- 
pared to simulations first, before extrapolating to the 
triply 6-tagged background. The expected signal con- 
tribution to the doubly 6-tagged data is negligible. The 
comparison in invariant mass spectrum of the two jets 
of highest pr (not necessarily the two 6-tagged jets) in 
the doubly 6-tagged data with the expected background 
is shown in Fig. |J] The 6-tagging in this analysis does 
not distinguish between contributions from bottom and 
charm events. However, the efficiency for tagging a c-jet 
is known from simulations to be about 1/4 of that for 
tagging a 6-jet. Therefore, when two 6-tags are required, 
the fraction of ccj(j) events relative to bbj(j) events will 
be a factor of w 16 lower after tagging. We have es- 
timated the fractions of ccjj to bbjj prior to 6-tagging 
using the madgraph Monte Carlo generator 0. The 
ccjj cross section is 22% higher than bbjj for the same 
generator-level selections. Therefore, the contribution of 
ccj(j) in the doubly 6-tagged data sample is expected to 
represent about 8% of the events. Thus, when we refer 
to the bbj(j) normalization, it should be understood that 
approximately 8% of the events are from the ccj(j) pro- 
cess. After these corrections for ccj(j) events, the HF 
multijet processes are only a factor of 1.08 higher in data 
than predicted by ALPGEN. The shape of the estimated 
background agrees well with the data over the entire in- 
variant mass region. 

To estimate the background for triply 6-tagged events, 
the mis-tag function is applied to the non- 6-tagged jets 
in the doubly 6-tagged events. This provides the shape of 
the multijet background distribution with at least three 
6-tagged jets. This neglects any contributions from pro- 
cesses with more than two true 6-jets, such as from 6666 
and Z(^bb)bb production. However, the shapes of these 
backgrounds from simulations are similar to those of the 
doubly 6-tagged spectra, and their rates are small. The 
overall background normalization is therefore determined 
by fitting the leading two jets invariant mass spectrum in 
triply 6-tagged events outside of the hypothesized signal 
region to the estimated shape for triply 6-tagged back- 
ground. The systematic effect on the normalization of 
the background from any signal contributing outside the 
search window was studied and found to be small relative 
to other uncertainties, as described below. 
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FIG. 2: Fit of the invariant mass spectrum of the two leading 
Pt jets in the doubly 6-tagged data to a sum of backgrounds: 
mis-tags derived from data (dotted), bbj(j) (dashed), and 
other backgrounds (Z(— >66)+jets, Zb, it and bbbb) (dashed- 
dotted). 
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TABLE I: Signal acceptances for each set of criteria (in %). 
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FIG. 4: The expected and measured 95% C.L. upper limits 
on the signal cross section as a function of tua. The band 
indicates the ±1 sd range on the expected limit. Also shown 
is the cross section for the signal at tan /3 — 80 in the "no mix- 
ing" scenario of the MSSM, with the theoretical uncertainty 
indicated by the overlaid band. 



FIG. 3: Invariant mass spectrum of two leading jets in events 
with at least three b-tagged jets, estimated background, and 
the signal for a 120 GeV Higgs boson that can be excluded at 
the 95% C.L. 

The selections in this analysis can be grouped into trig- 
ger level, kinematic {pt, t), nj), where rij is the number 
of untagged jets, and &-tagging. Table [I] shows the accep- 
tances for each set of criteria made in the analysis, for six 
values of Higgs boson mass. The systematic uncertainty 
on signal acceptance is nearly independent of assumed 
rriA, and is dominated by the uncertainty on 6-tagging 
efficiency (±15%), followed by uncertainties on jet en- 
ergy scale, resolution and identification efficiency (±9% 
in sum). These uncertainties are calculated by repeating 
the analysis with each value changed by ± one standard 
deviation (sd). The systematic uncertainties correspond- 
ing to uncertainties in px distributions for simulated sig- 
nal at NLO, the integrated luminosity, and the trigger 
efficiency are found to be ±5%, ±6.5%, and ±9%, respec- 
tively. These uncertainties, added in quadrature, result 
in a total systematic uncertainty of ±21%. 

The accuracy in modeling the shape of the background 
distribution can be estimated from the /dof between 



the estimated background and the data. The statistical 
error associated with the uncertainty in the normaliza- 
tion of the background (from the fit outside the signal 
region) is multiplied by y/x 2 ' /dof '. The background un- 
certainty is estimated to be < 3%. The systematic un- 
certainty arising from the width chosen for the search 
window is evaluated by varying it from less than the reso- 
lution to ±1.8 sd, centered on the peak value. The result- 
ing change in background normalization is much smaller 
than from other sources of background uncertainties. 

A modified frequentist method is used to set limits on 
the production of signal [19J. The di-jet invariant mass 
distributions in triply &-tagged events of data, simulated 
signal, and the normalized background were used as in- 
puts. The value of tan (3 was varied until the confidence 
level for signal (CLs) was < 5%. Figure EH shows the 
data, background, and simulated signal at the exclusion 
limit, for tua — f 20 GeV. This is converted to a cross sec- 
tion limit for signal production in Fig.0J which also shows 
the expected MSSM Higgs boson production cross section 
as a function of niA for tan (3 = 80, and the median ex- 
pected limit with the background-only hypothesis along 
with its ± 1 sd range. The NLO cross sections and their 
uncertainties from parton distribution functions (PDF) 
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FIG. 5: The 95% C.L. upper limit on tan /3 as a function of tua 
for two scenarios of the MSSM, "no mixing" and "maximal 
mixing." Also shown are the limits obtained by the LEP 
experiments for the same two scenarios of the MSSM 3]. 



and scale dependence are taken from Refs. 0, Q . The 
MSSM cross section shown in Fig. 0] corresponds to no 
mixing in the scalar top quark sector [2(|, or X t = 0, 
where X t — A t — /icot f3, A t is the tri-linear coupling, 
and the Higgsino mass parameter /i = —0.2 TeV. We 
also interpret our results in the "maximal mixing" sce- 
nario with Xt = V6 x A/susy, where Msusy is the mass 
scale of supersymmetric particles, taken to be 1 TeV. 

Results for both scenarios of the MSSM are shown in 
Fig.[S]as limits in the tan /3 versus ttla plane. The present 
D0 analysis, based on 260 pb" 1 of data, excludes a sig- 
nificant portion of the parameter space, down to tan/3= 
50, depending on niA and the MSSM scenario assumed. 
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